A Primer on Molecular Similarity in QSAR and Virtual Screening Part III – Connecting descriptors and experimental measurements – model generation

نویسنده

  • Andreas Bender
چکیده

The generation of models relating structure and some form of measurement of structural properties, be it bioactivity or any physicochemical property, consists of the description of the problem, the choice of a dataset of experimental endpoints, and the construction of the actual mathematical model. The descriptor generation was discussed in the first part of this primer, with the conclusion to choose descriptors relevant to the problem and of not overly complex nature. In the second part we were discussing experimental endpoints that can be used as “output variables” of models. Here the conclusion was that also experimental data are not without fault – data measured in a single laboratory may show significant differences (e.g. in high-throughput screenings), and combining results from different labs might be even more error-prone. Therefore, a clean dataset, measured by a single, well-defined experimental procedure should be used in the ideal case. In the current, concluding part of the primer I would like to discuss the basics of constructing a useful mathematical model that connects molecular descriptors as an independent variable, and outputs (predicted) properties as the dependent variable. The focus of this part of the primer will be on two steps commonly performed in the generation of QSAR/QSPR models, namely feature selection and model validation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

QSAR Comparative Study of Wiener Descriptors for Weighted Molecular Graphs

Quantitative structure-property relationship (QSPR) and quantitative structure-activity relationship (QSAR) studies use statistical models to compute physical, chemical, or biological properties of a chemical substance from its molecular structure, encoded in a numerical form with the aid of various descriptors. Structural indices derived from molecular graph matrices represent an important gro...

متن کامل

New QSAR prediction models derived from GPCR CB2-antagonistic triaryl bis-sulfone analogues by a combined molecular morphological and pharmacophoric approach.

In order to build quantitative structure-activity relationship (QSAR) models for virtual screening of novel cannabinoid CB2 ligands and hit ranking selections, a new QSAR algorithm has been developed for the cannabinoid ligands, triaryl bis-sulfones, using a combined molecular morphological and pharmacophoric search approach. Both pharmacophore features and shape complementarity were considered...

متن کامل

The great descriptor melting pot: mixing descriptors for the common good of QSAR models

The usefulness and utility of QSAR modeling depends heavily on the ability to estimate the values of molecular descriptors relevant to the endpoints of interest followed by an optimized selection of descriptors to form the best QSAR models from a representative set of the endpoints of interest. The performance of a QSAR model is directly related to its molecular descriptors. QSAR modeling, spec...

متن کامل

Quantitative Structure Activity Relationship Analysis of Coumarins as Free Radical Scavengers by Genetic Function Algorithm

The antioxidant properties of coumarin derivatives using the 2,2ˈ -diphenyl-1- picrylhydrazyl (DPPH) radical scavenging assay were investigated by the application of Quantitative Structure Activity Relationship (QSAR) studies. The molecular structures were optimized and submitted for the generation of quantum chemical and molecular descriptors. Genetic Function Algorithm (GFA) was employed in m...

متن کامل

In-silico prediction of Cellular Responses to Polymeric Biomaterials from Their Molecular Descriptors

In this work quantitative structure activity relationship (QSAR) methodology was applied for modeling and prediction of cellular response to polymers that have been designed for tissue engineering. After calculation and screening of molecular descriptors, linear and nonlinear models were developed by using multiple linear regressions (MLR) and artificial neural network (ANN) methods. The root m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007